Search CORE

4 research outputs found

Identification of sequence motifs significantly associated with antisense activity

Author: McQuisten Kyle A
Peek Andrew S
Publication venue: BioMed Central
Publication date: 01/06/2007
Field of study

Abstract Background Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. Results We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. Conclusion The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic mediators to speed the process along like the RNA Induced Silencing Complex (RISC) in RNAi. The independence of motif position and antisense activity also allows us to bypass consideration of this feature in the modelling process, promoting model efficiency and reducing the chance of overfitting when predicting antisense activity. The increase in SVR correlation with significant features compared to nearest-neighbour features indicates that thermodynamics alone is likely not the only factor in determining antisense efficiency.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Comparing Artificial Neural Networks, General Linear Models and Support Vector Machines in Building Predictive Models for Small Interfering RNAs

Author: A Fire
A Henschel
A Khvorova
A Reynolds
AC Hsieh
AM Chalk
Andrew S. Peek
AS Peek
B Jagla
C Nadeau
C Xue
D Huesken
D Huesken
DK Walters
DS Schwarz
G Ge
H Tafer
I Bradac
I Ladunga
JP Vert
K Ui-Tei
Kyle A. McQuisten
L Poliseno
M Amarzguioui
M Ichihara
O Matveeva
P Jia
P Jiang
P Sætrom
P Sætrom
R Kretschmer-Kazemi Far
R Teramoto
RS de Almeida
S Takasaki
SA Bohula EA
SA Shabalina
SM Yiu
Stefan Wölfl
T Holen
T Katoh
TA Vickers
TG Dietterich
W Gong
ZJ Lu
Publication venue: Public Library of Science
Publication date: 01/10/2009
Field of study

Exogenous short interfering RNAs (siRNAs) induce a gene knockdown effect in cells by interacting with naturally occurring RNA processing machinery. However not all siRNAs induce this effect equally. Several heterogeneous kinds of machine learning techniques and feature sets have been applied to modeling siRNAs and their abilities to induce knockdown. There is some growing agreement to which techniques produce maximally predictive models and yet there is little consensus for methods to compare among predictive models. Also, there are few comparative studies that address what the effect of choosing learning technique, feature set or cross validation approach has on finding and discriminating among predictive models.Three learning techniques were used to develop predictive models for effective siRNA sequences including Artificial Neural Networks (ANNs), General Linear Models (GLMs) and Support Vector Machines (SVMs). Five feature mapping methods were also used to generate models of siRNA activities. The 2 factors of learning technique and feature mapping were evaluated by complete 3x5 factorial ANOVA. Overall, both learning techniques and feature mapping contributed significantly to the observed variance in predictive models, but to differing degrees for precision and accuracy as well as across different kinds and levels of model cross-validation.The methods presented here provide a robust statistical framework to compare among models developed under distinct learning techniques and feature sets for siRNAs. Further comparisons among current or future modeling approaches should apply these or other suitable statistically equivalent methods to critically evaluate the performance of proposed models. ANN and GLM techniques tend to be more sensitive to the inclusion of noisy features, but the SVM technique is more robust under large numbers of features for measures of model precision and accuracy. Features found to result in maximally predictive models are not consistent across learning techniques, suggesting care should be taken in the interpretation of feature relevance. In the models developed here, there are statistically differentiable combinations of learning techniques and feature mapping methods where the SVM technique under a specific combination of features significantly outperforms all the best combinations of features within the ANN and GLM techniques

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

IDT SciTools

Author: Andrew S. Peek
Andrey V. Tataurov
Chris A. Sailor
Hakeem G. Almabrazi
Jeffrey A. Manthey
Justin Garretson
Kent F. Pedersen
Kyle A. Mcquisten
Neil O. Mcentaggart
Richard Owczarzy
Robert B. Dawson
Yihe Wu
Yuan Lin
Publication venue
Publication date
Field of study

a suite for analysis and design of nucleic acid oligomer

CiteSeerX